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(57) Abstract 

A method and apparatus are provided for compressing data 
received from a digitizer tablet based on the characteristics of each 
individual stroke contour (150, 160). The digitizer tablet samples 
the position of the writing pen, continuously transmitting data to a 
computer in the form of x and y coordinates plus an indicator of 
whether or not the pen is touching the surface of the tablet (100, 
1 10). After preprocessing, a determination is made of which points 
along the stroke contour are essential to the reconstruction of a 
quality facsimile of the original writing from the sampled points 
(140). This determination is based upon local curvature, local ex- 
terna, and the endpoints of the stroke. In an optional second stage 
of compression, a standard mathematical compression technique 
is applied (160). 
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A METHOD AND APPARATUS FOR COMPRESSION OF ELECTRONIC INK 

BACKGROUND OF THE INVENTION 

The present invention relates generally to data 
capture applications, and more specifically/ to the compression 
of data generated on a digitizer tablet. 

When something is written on a digitizer tablet, the 
output is a parametric representation of the writing 
trajectory; that is, the writing is represented as a series of 
x f y coordinate values as a function of time. A typical 
digitizer tablet samples the position of the writing pen 100 
times per second, at fixed intervals of 0.01 seconds. The 
tablet continuously transmits the resulting data to the 
receiving computer. Thus every 0.01 seconds, the tablet 
samples the pen position and transmits an x-coordinate value, a 
y-coordinate value, and an additional value which indicates 
whether the pen is up or down. The data generated in this 
manner is commonly referred to as electronic ink. 

There are applications where it is of interest to 
capture and save the electronic ink. For example, it may be 
desirable to save an electronically generated signature for 
later signature recognition. Another application is the 
permanent storage of data which has been generated on a 
digitizer tablet. 

The problem with saving electronic ink is that the 
storage requirements can be quite large if the original sampled 
data is stored directly. Therefore there is a need for a data 
reduction technique for electronic ink that can achieve high 
data compression and yet be efficient to compute. The 
computational efficiency is important because electronic ink 
may be used on computers with relatively modest CPU power. 
Even systems of greater power might be significantly slowed if 
a complex decompression technique had to be applied to each pen 
stroke. (Note: a "stroke" is defined as the continuous segment 
which begins when the pen touches the digitizer tablet's 
surface and ends when the pen is lifted from the surface.) 



WO 94/03853 PCT/US93/06883 

2 

There are a number of methods for data compression. 
The earliest methods were based on the idea of minimum 
redundancy coding. The basic idea was very simple; symbols 
which occur frequently are coded with fewer bits than those 
5 which occur more rarely. The earliest known method to apply 
this technique was the Shannon-Fano coding. A subsequent 
development was Huffman coding, which is similar to Shannon- 
Fano coding in many ways. Huffman's contribution was in 
devising a very qlever and elegant way to construct the coding 

10 and decoding trees. In general, simple Huffman coding does not 
achieve the same degree of compression as more modern 
techniques, but processing and memory requirements are 
relatively modest. 

A further enhancement of Huffman coding is called 

15 adaptive Huffman coding. A basic problem with standard Huffman 
coding is that the decoding program must have access to the 
Huffman coding statistics. Hence, trying to increase 
compression with more comprehensive statistics is eventually 
self-defeating since more and more coding statistics must also 

20 be saved, which works against the compression. Adaptive 
Huffman coding gets around this problem by adjusting the 
Huffman decoding tree on the fly, effectively gaining the 
advantages of higher order modeling with no additional storage 
for statistics. 

25 Huffman coding requires that an integral number of 

bits be used to code symbols. Arithmetic coding improves 
compression by using a fractional number of bits per code. 
Although arithmetic coding will always provide as good or 
better compression than Huffman coding, its disadvantage is a 

30 significantly larger computational burden. 

Dictionary coding is another class of data 
compression. In general, these techniques create a dictionary 
of symbol sequences or combinations which are then accessed by 
indexing into the dictionary. These techniques have become the 

3 5 standard for compression on small computers because they 
combine good compression with modest memory requirements. 
Examples of codes using these techniques which are applicable 
to MS-DOS include PKZIP, ARC, ARJ, and Lharc. 



10 
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There are also compression techniques that apply 
primarily to specialized domains. An example is compression of 
two dimensional images using the discrete cosine transform and 
its many derivatives. 

A method and apparatus for compressing electronic ink 
which achieves high compression while requiring minimal memory 
and CPU power is needed. 

SUMMARY OF THE INVENTION 
According to the invention, for a computer with a 
digitizing tablet, a method and apparatus are provided for 
compressing data based on the characteristics of each 
individual stroke contour. 

The digitizer tablet samples the position of the 
15 writing pen a pre-determined number of times per second and at 
fixed intervals. This data is continuously transmitted to a 
computer in the form of x and y coordinates plus an indicator 
of whether or not the pen is touching the surface of the 
tablet. 

20 In the preferred embodiment the incoming data from 

the digitizer tablet is preprocessed. This preprocessing, 
which is application and/or system specific, is not part of the 
compression. Examples of preprocessing include smoothing for 
noise reduction, endpoint filtering, discarding duplicate data 

25 points, median filtering, Hanning filtering, spatial sampling, 
and size normalization. 

In the first stage of compression a determination is 
made of which points along the stroke contour are essential to 
the reconstruction of a quality facsimile of the original 

30 writing from the sampled points. The method proceeds by 

calculating descriptors and finding essential features of the 
stroke contour. In the preferred embodiment, the descriptors 
are local curvature, local extrema, and the endpoints of the 
stroke. Although the actual algorithm is somewhat detailed, in 

35 general the idea is that the extrema are saved, the points 

about which the local curvature is above a threshold are saved, 
and the endpoints of the stroke are saved. 
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* ** In the second stage of compression any number of 

standard compression techniques can be applied including 
Huffman coding, adaptive Huffman coding, arithmetic coding, and 
a variety of dictionary compression techniques* In the 
5 preferred embodiment differences of the form Ax = - and 
A Y = Yi ~ Yi-i are calculated and the stage 2 compression is 
applied to the differences. 

BRIEF DESCRIPTION OF THE DRAWINGS 
10 Fig. 1 is a flow chart of the general compression 

method according to the invention. 

Fig. 2 is a flow chart of the extrema determination 

step. 

Fig. 3. is a flow chart of the stroke endpoint 
15 determination step. 

Fig. 4 illustrates the local curvature calculations. 
Fig. 5 is a flow chart of the duplicate data points 
preprocessing step . 

Fig. 6 illustrates the endpoint filtering 
20 preprocessing step. 

Fig. 7 is a flow chart of the endpoint filtering 
preprocessing step. 

Fig. 8 is a flow chart of the median filtering 
preprocessing step. 
25 Fig. 9 is a flow chart of the Hanning filtering 

preprocessing step. 

Figs. 10A-C illustrate the reconstruction of an 
electronically written signature without applying this 
invention's compression techniques. 
30 Figs. 11A-C illustrate the reconstruction of an 

electronically written signature after applying this 
invention's compression techniques. 

Figs. 12A-C illustrate the reconstruction of a 
sequence of lower case letters after applying this invention's 
35 compression technique. 

Figs. 13A-C illustrate the reconstruction of a simple 
line after applying this invention's compression technique. 
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Figs. 14A-E illustrate the trade off between 
compression ratio and reconstruction fidelity. 

Fig. 15 illustrate the apparatus to accomplish the 
disclosed method. 

5 

DESCRIPTION OF THE SPECIFIC EMBODIMENT (S) 
The flow chart of Fig. 1 illustrates the compression 
method described by this invention. Data is input by writing 
on a digitizer tablet (step 100). The tablet represents the 

10 writing as a series of x and y coordinate values as a function 
of time (step 110) . This stream of x,y position coordinates 
along with a pen up/pen down indicator for each coordinate pair 
is continuously sent to the computer (step 120). 

In the preferred embodiment, prior to data 

15 compression, the data is preprocessed (step 130) . The exact 
form of preprocessing will vary according to the application 
and system. Preprocessing is generally used to remove noise. 
The details are unimportant as long as a good representation of 
the written input data is maintained. Another optional step 

20 which is dependent upon the application and system is the 

removal of duplicate or close points (step 140) . This step is 
generally necessary when the digitizer tablet's user writes 
slowly. Slow writing causes oversampling resulting in duplicate 
or close points which are redundant and do not carry 

25 significant information. 

The first stage of compression is based on the 
characteristics of each individual stroke contour (step 150) . 
In the second stage of compression any of the standard 
mathematical compression techniques can be applied (step 160) . 

3 0 Although this staige of compression is not required by the 
invention, it is employed in the preferred embodiment as a 
potential means of further compressing the data. 

After the completion of the second stage, a 
comparison is made between the first stage and the second stage 

3 5 compression (step 170) . If the compression was not improved 
after applying stage two, then the stage one compression data 
is saved (step 180) . If the data was further compressed by 
stage two, then this compression data is saved (step 190) . 
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The computer program modules used to compress the 
electronic data are given in the appendix. Significant program 
modules and their specific functions are as follows: 

5 Stage one compression: 

calc_sav.c Function to determine which points in the 

signature need to be saved, based on various 
criteria. 

curvature. c Function to calculate the local curvature at a 
10 point along the contour. 

Stage two compression: 

cmpsig2.c Function to compress an array of stage 1 data 

with characteristics of the signature 
verification template data. 
15 pack_half bytes. c 

Function to pack the value to be compressed into 
half bytes. 

set_4_bits Function to pack values into the upper or 
lower 4 bits of a byte. 
2 0 get_4_bits Function to unpack values from the upper or 

lower 4 bits of a byte. 
uncmpsig2.c Function to uncompress an array of data 

compressed by the function cmpsig2.c. 
unpack_half bytes . c 
25 Function to unpack and reconstruct the value 

from half bytes. 



Figs. 2-4 are detailed illustrations of the stage one 
compression methods. Fig. 2 shows a flow chart of the first 

30 step of compression in which extrema are determined and saved. 
The digitizer data is in the form of a sequence of coordinate 
values, x i and y i# as a function of time (step 210). The 
subscript i varies from i=0 to i=nsamps-2 where nsamps is the 
total number of data points in a stroke. The subroutine is 

35 initialized by setting i equal to 1 (step 220) . 

Minima and maxima are determined by comparing- the 
value of each data point to the values of the data points 
directly adjacent to it (step 2 30). To be a maxima, the value 
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*of either the x coordinate or the y coordinate must be larger 
then the values of the corresponding coordinate of the data 
points on both sides of the data point under review (step 230) . 
To be a minima, the value of either the x coordinate or the y 
5 coordinate must be smaller then the values of the corresponding 
coordinate of the data points on both sides of the data point 
under review (step 230) . 

If any of the four relationships defined by step 230 
are met (step 240), then that data point will be saved (step 

10 250) , otherwise the data point is not saved. After determining 
whether or not to save a data point, the value of i is 
increased by one (step 260) and a determination is made as to 
whether or not all data points have been reviewed (step 270) . 
If all data points have been reviewed then this subroutine ends 

15 (step 280), otherwise the subroutine loops back to the 

comparison step (step 230) , continuing until all data points 
have been reviewed. 

Fig. 3 shows a flow chart of the second step of 
compression in which stroke endpoints are determined and saved. 

20 The digitizer data is in the form of a sequence of coordinate 
values, x i and y i; as a function of time (step 310) . The 
subscript i varies from i=0 to i=nsamps-l where nsamps is the 
total number of data points in a stroke. The subroutine is 
initialized by setting i equal to 0 (step 315) . 

25 This subroutine first determines whether or not the x 

and y coordinates of the data point indicate that it is the 
first data point of the stroke (step 320) . If it is the first 
data point, then it is saved (step 3 30) . Next the subroutine 
determines whether or not the x and y coordinates of the data 

30 point indicate that it is the last data point of the stroke 
(step 340). If it is the last data point of the stroke, then 
it is saved (step 350) . After determining whether or not to 
save the data point, a comparison is made to determine if all 
data points have been tested (step 3 60) . If they have been 

35 then this subroutine ends (step 370), otherwise the value of i 
is increased by one (step 3 80) and endpoint determination 
continues. 
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Fig. 4 illustrates the calculations used to determine 
the curvature at each data point between the stroke endpoints. 
The stroke contour is represented by line 410 while the 
individual digitizer data points are represented by points 420. 
5 Point 430 is data point i, the data point of interest in this 
illustration. The coordinates of 430 are x^ and y^ 

In Fig. 4 line 440 is defined as the line segment 
connecting point 430 with point 425. The coordinates of point 
425 are x i-d and Y^ d , where d is equivalent to 3. Line 450 is 

10 defined as the line segment connecting point 430 to pbint 435 
where point 435 has coordinates x i+d and yi+ d . Curvature angle 
460 is the angle defined by lines 440 and 450. 

During stage one compression, the curvature angle for 
each data point is calculated. If the calculated curvature 

15 angle is within a certain range of angles, the data point is 
saved. If the angle does not fall within the pre.-selected 
range of angles, the data point is discarded. The choice of 
the variable "d" as well as the range of threshold angles is 
determined by the application's requirements. In the preferred 

20 embodiment, the value of d is 2; data points which have a 

curvature angle between 5 degrees and 25 degrees are retained 
for applications requiring high fidelity, low compression 
(e.g., signature verification); and data points which have a 
curvature angle between 25 and 50 degrees are retained for 

25 applications requiring low fidelity, high compression (e.g., 
simple data storage) . 

Figs. 5-8 are detailed illustrations of optional data 
preprocessing methods. Fig. 5 is a flow chart of the method by 
which duplicate data points are discarded. The digitizer data 

30 is in the form of a sequence of coordinate values, x^ and y^, 
as a function of time (step 510) . The subscript i varies from 
i=0 to i=nsamps-l where nsamps is the total number of data 
points in a stroke. The subroutine is initialized by setting i 
equal to 1, x sav equal to x Q and y sav equal to y Q (step 520). 

35 The current data point, represented by x.^ and y^, is 

compared to the previous data point, represented by x„„ and 

** sav 

y sav ( ste P 53 °) • If the values for the x and y coordinates are 
identical, then the current point is discarded (step 54 0) and 
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"the next point is compared to x sav and y sav . This process 
continues until either all of the data points have been 
reviewed (step 560) or until x ± and y i do not equal x sav and 
y S av respectively. When the two data points are no longer 
5 equivalent, x sav is set equal to x i and y sav is set equal to y i 
(step 545). The value of i is then increased by one (step 550) 
and the new value of i is compared to the total number of data 
points (step 560) . The subroutine continues until all data 
points have been reviewed at which time the subroutine ends 
10 (step 570) . 

Often the beginning and ending of a stroke have 
spurious artifacts which can be removed with little effect on 
the writing's fidelity. These artifacts can be due to the 
writer's unsureness, resulting in wiggling of the pen upon 

15 contact, or due to pen skidding. Fig. 6 illustrates the 

preprocessing method of endpoint filtering which can be used to 
remove these artifacts. 

Stroke contour 610 is made up of a series of data 
points 620. Circle 630, centered at the stroke's first data 

20 point 635, and circle 640, centered at the stroke's last data 
point 645, are used to define the threshold by which points are 
either discarded or saved. In the preferred embodiment, the 
physical radius of this circle is set at 0.5 millimeters. The 
number of data points contained within these circles is 

25 dependent upon the resolution of the digitizer tablet. The 
resolution of a typical tablet is 10 points per millimeter. 
Therefore the threshold, t, given in tablet units is obtained 
by multiplying the desired physical distance (0.5 millimeters) 
by the tablet's resolution (10 points/millimeter) . 

30 Fi 9» 7 is a flow chart for the endpoint filtering 

preprocessing method. The digitizer data is in the form of a 
sequence of coordinate values, x L and y if as a function of time 
(step 710) . The subscript i varies from i=0 to i=nsamps-l 
where nsamps is the total number of data points in a stroke. 

35 The subroutine is initialized by setting i equal to 0 (step 
715) and setting j equal to 1 (step 720) . 

The variable d is calculated (step 730) and compared to 
the pre-selected threshold, t (step 740) . If d is less than t, 
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^indicating that this point is within circle 630, the point 
corresponding to x j , yj is discarded (step 750). The value of 
j is then increased by one (step 760) and the calculation for d 
(step 730) is repeated. This process continues until d is not 
5 less then the threshold value, at which time that value of x j , 
yj is saved as the second data point (step 770). This 
completes the subroutine for endpoint filtering for the 
beginning of a stroke. 

The endpoint filtering method used for the last point 
10 of a stroke is identical to that used for the first point 
except that to initialize the subroutine i is set equal to 
nsamps-1 (step 715) and j is set equal to nsamps-2 (step 720). 

Median filtering is a preprocessing method used in 
the preferred embodiment to reduce the effects of tablet 
15 glitches that result in sudden jumps or discontinuities in the 
data stream. Fig. 8 is a flow chart illustrating this method. 
The digitizer data is in the form of a sequence of coordinate 
values, x i and y if as a function of time (step 810). The 
subscript i varies from i=l to i=nsamps-2 where nsamps is the 
20 total number of data points in a stroke. The subroutine is 
initialized by setting i equal to 1 (step 820). 

In this preprocessing method, a new value for each 
coordinate is calculated based on the median value of three 
successive data points (steps 830 and 840) . The value of i is 
25 then increased by. one (step 850) and the subroutine continues 
until the value of i is equal to the total number of samples 
minus two (step 860) at which time the subroutine is complete 
(step 870) . 

A flow chart of the last preprocessing method used in 
30 the preferred embodiment is illustrated in Fig. 9. The 
digitizer data is in the form of a sequence of coordinate 
values, Xi and y i7 as a function of time (step 910). The 
subscript i varies from i=l to i=nsamps-2 where nsamps is the 
total number of data points in a stroke. The subroutine is 
35 initialized by setting i equal to 1 (step 920) . 

In this method, a linear process is applied to reduce 
noise and smooth the stroke contour. A new value for each x 
and y coordinate is calculated based on the initial values of 
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'* three successive data points (step 930) . The value of i is 
then increased by one (step 940) and the subroutine continues 
until the value of i is equal to the total number of samples 
minus two (step 950) at which time the subroutine is complete 
5 (step 960) . 

Fig. 10A is an original signature. Fig. 10B shows 
the actual points received from the digitizer tablet. Fig. 10C 
is a reconstruction of the signature based upon the points 
shown in Fig. 10B. 

10 Fig. 11A is another original signature. Fig. 11B, as 

in Fig. 10B, shows the actual points received from the 
digitizer tablet and are shown as single points 1110. At this 
point, prior to compression, the signature requires 1088 bytes 
of storage- The points 1120 which have been circled are those 

15 points which have been determined to be critical to the 
reconstruction fidelity. Fig. lie is the reconstructed 
signature based upon the circled points 1120. This signature 
required only 219 bytes of storage, or 2 0 percent of the 
original. The stage one compression reduced the 1088 bytes to 

20 584 bytes, which was further reduced to 219 bytes by applying a 
stage two compression technique. 

Fig. 12A is the original of a sequence of lower case 
letters. As in Fig. 11B, Fig. 12B shows both actual points 
1210 received from the digitizing tablet and points 1220 

25 determined to be critical to the reconstruction fidelity. Fig. 
12C is the reconstruction based upon circled points 1220. The 
original required 2072 bytes, the reconstruction based upon 
stage one compression only required 800 bytes (39 percent of 
the original) , and the reconstruction shown in 12C in which 

30 both stage one and stage two compression was applied required 
301 bytes (15 percent) . 

Fig. 13A-C illustrates that the algorithm 
automatically adjusts to the complexity of the electronic ink. 
Fig. 13A is the original of a straight line which contains 

35 little structure. Fig. 13B shows both actual points 1310 

received from the digitizing tablet and points 1320 determined 
to be critical to the reconstruction fidelity. Fig. 13C is the 
reconstruction based upon circled points 1320 in which only 
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stage one compression has been applied. The reconstruction 
required only 6 bytes and represented a compression to 2 
percent of the original. In this case stage two compression 
was not applied because it increased the number of bytes to 14. 
5 Figs. 14A-E illustrates the trade between compression 

ratio and the fidelity of the reconstruction. Fig. 14A is the 
original of a printed address which required a total of 1808 
bytes. Fig. 14B shows the first level of compression which 
employed a compression ratio similar to that used in Figs, lie 

10 and 12C. After both stage one and stage two compression, this 
reconstruction is 13 percent of the original. Figs. 14C-E show 
gradually increasing levels of compression. The reconstruction 
shown in Fig. 14C, requiring only 9 percent of the original, 
appears quite similar to the original. Figs. 14D-E, although 

15 quite readable, show definite differences from the original. 
The reconstructions in Figs. 14D and 14E require 8 percent and 
7 percent of the original storage space, respectively. 
Although some applications require high fidelity, such as 
signature verification, many applications could tolerate the 

2 0 lower fidelity accompanying the high compression ratios shown 

in Figs. 14D-E. 

Fig. 15 shows an embodiment of the apparatus to 
accomplish the method described in this invention. In the 
preferred embodiment, the computer 1550 is an IBM/PC/AT 
25 compatible with an IBM compatible monitor 1540 and keyboard 
1510. Computer 1550 employs the methods disclosed by this 
invention to compress the electronic ink. Handwritten 
information is input either directly via the hardwired 
digitizer tablet 1520 or via a portable digitizer tablet 1530. 

3 0 Portable digitizer tablets are in common use in many 

industries, including the package delivery industry where an 
electronic signature is obtained upon delivery of a package. 
Depending upon the computing power of tablet 153 0, the 
signature can either be immediately compressed using this 
3 5 invention's compression methods or temporarily stored for 

subsequent downloading to computer 1550. Once the handwritten 
data has been compressed, it can either be stored in computer 
1550 or in separate memory 1560. Monitor 1540 allows the 
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original handwritten data to be compared with the compressed 
data. 

The invention has now been explained with reference to 
specific embodiments. Other embodiments will be apparent to 
5 those of ordinary skill in this art in light of this 

disclosure. It is therefore not intended that this invention 
be limited except as indicated by the appended claims. 
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APPENDIX 

The computer program modules used to compress the 
5 electronic data are given in this appendix. Significant 

program modules and their specific functions are as follows: 

Stage one compression: 

calc_sav.c Function to determine which points in the 
10 signature need to be saved, based on various 

criteria* 

curvature. c Function to calculate the local curvature at a 
point along the contour. 

15 Stage two compression: 

cmpsig2-c Function to compress an array of stage 1 data 

with characteristics of the signature 

verification template data. 
pack_half bytes . c 

2 0 Function to pack the value to be compressed into 

half bytes. 

set_4_bits Function to pack values into the upper or 

lower 4 bits of a byte. 
get_4_bits Function to unpack values from the upper or 
25 lower 4 bits of a byte. 

uncmpsig2.c Function to uncompress an array of data 

compressed by the function cmpsig2.c. 
unpack_half bytes, c 

Function to unpack and reconstruct the value 
30 from half bytes. 



35 

/* CALC_SAV.C 
* 



Function to determine which points in the signature need to 
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15 



20 



25 



30 



35 



15 . 

* be saved, based on various criteria. The results are 

* returned in array save_ptsC. 

* For example, if the ith point of x[],y[] should be saved 

* then save_ptsC[i] = 1, else save_pts[i] = 0; 

* cur_comp returns: 1 (i.e. YES or TRUE) for successful 

scaling 

0 (i.e. NO or FALSE) if there is an 
error 



* input arguments: 

* segpnt, x, y 
* 

* output arguments: 
* 



curvature_threshold , nsamps , numsegs , 



save_j?ts 



*/ 

# include <stdio.h> 
#include <cic.h> 

BOOL calc_sav(curvature_threshold, nsamps, numsegs, segpnt, x, 
y, save_ptsC) 



COUNT 


curvature_threshold ; 


/* 


threshold for curvature*/ 






/* 


compression 


*/ 


COUNT 


nsamps ; 


/* 


number of samples 


*/ 






/* 


(length) in input 


*/ 






/* 


arrays x and y 


*/ 


COUNT 


numsegs ; 


/* 


number of individual 


*/ 






/* 


segments ( strokes ) 


*/ 


COUNT 


segpnt [ ] ; 


/* 


pointers to beg of 


*/ 






/* 


individual strokes 


*/ 


COUNT 


*[]; 


/* 


input array of x 


*/ 






/* 


values 


*/ 


COUNT 


y[]; 


/* 


input array of y 


*/ 






/* 


values 


*/ 


TINY 


save_ptsC[ ] ; 


/* 


contains information 


*/ 






/* 


about what points to 


*/ 






/* 


keep (returned to 


*/ 






/* 


calling program) 


*/ 


COUNT 


i; 


/* 


loop counter 


*/ 


COUNT 


k; 


/* 


loop counter 


*/ 
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COUN 1 


valuel ; 


/* 


temporary variable 


*/ 


uUUli JL 


valuer t 


/* 


temporary variaDie 


*/ 


COUN I 


checkB ; 


/ .A. 
/* 


temporary variable 


*/ 


COUNT 


lower_limit ; 


/* 


defines starting pt 


*/ 






/* 


for curvat compress 


*/ 


^^^^^ 

COUNT 


upper_limit ; 


/* 


defines ending pt for 


*/ 






/* 


curvat compress 


*/ 


BOOL 


curvature ( ) ; 


/* 


function to calculate 


*/ 






/* 


local curvature 


*/ 


COUNT 


size_angle; 


/* 


calculated curvature 


*/ 






/* 


at a point 


*/ 


BOOL 


clockwiseB; 


/* 


YES if clockwise, NO 


*/ 






/* 


otherwise 


*/ 


BOOL 


calc_ dir ; 


/* 


if YES calculate 


*/ 






/* 


dir_angleP 


*/ 


COUNT 


dir_angle; 


/* 


direction of curvature 


*/ 






/* 


bisector 


*/ 



* Initialize the save_ptsc array. 
20 * 

V 

for (i = 0; i < nsamps; ++i) 
{ 

save_ptsC[i] =1; 
25 } 
/* 

* 

* Perform curvature based compression (determines which points 

* to keep of the stroke and saves the information in array 
3 0 * save_ptsC to return to the calling program) . 



*/ 

for (i = 0; i < numsegs; ++i) 
{ 

35 lower_limit = segpnt[i]; 

upper_limit = segpnt[i + 1] ; 
while (lower_limit < upper_limit) 
{ 
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checkB = NO; 

for (k = lower_lirait + 2; k < upper_limit; k += 2) 
{ 

curvature (x, y, (k + lower_limit) / 2, 
5 nsamps r (k - lower_limit) / 2, 

&size_angle , &clockwiseB , &calc_dir , 
&dir_angle) ; 
if (size_angle > curvature_threshold) 
{ 

10 checkB = YES; ' 

break; 
} 

) 

if (checkB) 
15 ( 

value2 = (k + lower_limit) / 2; 

valuel = value2 ; 

) 

else 

20 { 

valuel = segpntfi +1] - 1; 

value2 = (valuel + lower_limit) / 2; 

} 

for (k = lower_limit + 1; k < valuel; ++k) 
25 { 

if (k != value2) 

save_ptsC[k] = 0; 

} 

if (checkB) 
3 0 lower_limit = value2; 

else 

lower_limit = upper_limit ; 



} 



35 



/ 



* Return to calling program with successful completion flag. 
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* — 

*/ 

return (YES) ; 

/* 

5 * . 

*/ 

} /* end of function */ 

/* CALCES AV */ 

/* CURVATURE. C 

io * ! 

* Calculates the curvature, direction of curvature (angle) 

* bisector, and whether the change in angle is clockwise or 

* counter_clockwise at a point defined by function argument 

* "index" into the arrays of coordinates points x[], y[], 
15 * representing the stroke contour. The quantities returned 

* are in terms of the angle range 0-3 60 degrees. 

* If ERROR is detected the BOOL function value is YES (=1) , 

* while for a normal return the BOOL function value is NO 
20 * (=0). 

* 

* CAUTION: when the curvature is 0 (i.e., calculated 

* size_angleP = 0 or close to 0) , the calculation of the 

* direction of the curvature bisector is, of course, 
25 * inherently ambiguous and unreliable. 

* 

*/ 

#include <stdio.h> 
#include <cic.h> 

30 

BOOL curvature(x, y, index, maxpts, lghseg, size_angleP, 
clockwiseBP, calc_dirB, dir_angleP) 
COUNT x[]; /* x coordinate values of */ 

/* character */ 
35 COUNT y[]; /* y coordinate values of */ 

/* character */ 
COUNT index; /* point in x, y to calc. */ 

/* curvature */ 
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10 



15 



20 



25 



/* 
*- 



19 



COUNT 


maxpts ; 


/* 


maximum length of 


*/ 






/* 


arrays x,y 


*/ 


COUNT 


Ighseg ; 


/* 


length of segment for 


*/ 






/* 


curvatur calc 


*/ 


LUUM 1 


*size_angiep ; 


/* 


calculated curvature 


*/ 






/* 


at point index 


*/ 




*clockwiseBP ; 


/* 


YES if clockwise, NO 


*/ 






/* 


otherwise 


*/ 


BOOL 


calc_dirB; 


/* 


if YES calculate 


*/ 






/* 


dir_angleP, otherwise 


*/ 






/* 


otherwise return 


*/ 






/* 


dir_angle = o 


*/ 


COUNT 


*dir_angleP; 


/* 


direction of curvature 


*/ 


{ 




/* 


bisector 


*/ 


COUNT 


plus_point ; 


/* 


stores "index + Ighseg 


"V 


COUNT 


minus_point ; 


/* 


stores "index - Ighseg 


"V 


COUNT 


plus_angle; 


/* 


stores calc direction 


*/ 






/* 


of upper seg. 


*/ 


COUNT 


minus_angle ; 


/* 


stores calc direction 


*/ 






/* 


of lower seg. 


*/ 


COUNT 


dif_angle ; 


/* 


stores angle angle 


*/ 






/* 


difference 


*/ 


COUNT 


cic_atan2 ( ) ; 


/* 


function to calculate 


V 






/* 


angle in the 


*/ 






/* 


range -1800 to 1800 


*/ 






/* 


(degrees x 10) 


*/ 



30 



* Check for errors in specifying the 

* computation. 



range for curvature 



★ — 
*/ 



35 



minus_point = index - Ighseg; 
plus_point = index + Ighseg; 
if (minus_point < 0 } j plus_point >= maxpts) 
{ 

*size_angleP = 0; 
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*clockwiseBP = YES ; 
*dir_angleP = 0; 
return (YES) ; 



/* ERROR - out of bounds, */ 
/* cannot calc. curvature */ 



10 



/* 
*- 



* Calculation of the local curvature (i.e., angle change). 

* Note: the function cic_atan2 returns angle values in the 

* range -1800 to 1800, so it is necessary to divide by 10 to 

* get the desired curvatures. 



15 



20 



25 



30 



35 



plus_angle = cic_atan2 (x[plus_point ] - x[ index], 

y[plus_point ] - y [ index] ) ; 

minus_angle = cic_atan2 (x[ index] - x[minus_point] , 

y[index] - y [minus_point ] ) ; 

dif_angle = plus_angle - minus__angle ; 

if (dif_angle > 1800) 
dif_angle -= 3 600; 

else if (dif_angle < -1800) 
dif_angle += 3 600; 

if (dif_angle < 0) 



{ 

*size_angleP 
*clockwiseBP 



else 



{ 

*size_angleP 
*clockwiseBP 



(-dif_angle +5) / 10; 

YES; /* by convention 

/* clockwise 

/* angle changes are 

/* negative 



(dif_angle +5) / 10; 

NO; /* by convention 

/* counter-clockwise 
/* angle changes are 
/* positive 



*/ 
*/ 
*/ 
*/ 



V 
*/ 
*/ 
*/ 



20 



25 
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* Calculate the direction of the bisector of the local 

* curvature (i.-e., angle change) computed above in the range 

* 0-360 degrees. Note: the cic_atan2 returns angle values in 

* the range -1800 to 1800, so an adjustment is made in the 

5 * code below so that the resulting calculated bisector value 

* is in the range 0-3 60, 

★ 

*/ 

if (calc_dirB) 
10 { 

if (*clockwiseBP == YES) 

*dir_angleP = plus_angle - (900 + (dif_angle - 1) 
/ 2); 

else 

15 *dir_angleP = plus_angle + (900 - (dif_angle + l) 

/ 2); 

if (*dir_angleP < 0) 

*dir_angleP += 3 600; 
*dir_angleP = (*dir_angleP +5) / 10? 
if (*dir_angleP >= 360) 
*dir_angleP -= 360; 

} 

return(NO); /* normal return - no */ 

/* errors detected */ 



V 

) /* end of function */ 



/* CURVATURE. C */ 



/* 

30 CMPSIG2 ♦ C 



* Function to compress an array of data with charcteristics of 

* the signature verification template data. The companion 

* program to uncompress the data is UNCMPSIG2.C. NOTE: 
35 * compression/uncompression is exact (not lossy) and 

* restores the original data exactly. 



* 
* 



cmpsig2 returns: 1 (i.e. YES or TRUE) for successful 
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* compression 

* 0 (i.e. NO or FALSE) if there is an 

error 

★ 

5 * input arguments: 

* numin - number of elements (length) of data array to 

* be compressed 

* array - data array to be compressed 

* maxout - maximum allowed size of the compressed array 
10 * comp_arrayUC[ ] (to prevent overwriting 

* memory) 
* 

* output arguments: 

* num_comp_bytesP - ptr to # elements (each a byte) of 
15 * the compressed data stored in 

* comp_arrayUC[ ] . 

* comp_arrayUC - unsigned byte array containing the 

* compressed representation of the input 

* data in array[]; 



20 



*/ 

# include <stdio.h> 
#include <cic.h> 

25 BOOL cmpsig2 (numin, array, maxout, num_ comp_bytesP, 

comp_arrayUC) 

COUNT numin; /* number of points in input */ 

/* array */ 
COUNT array[]; /* array containing input */ 

3 0 /* data to be compressed */ 

COUNT maxout; /* max number of pts for */ 

/* comp_arrayUC */ 

COUNT *num_comp_bytesP; /* actual #bytes used in */ 

/* comp_arrayUC */ 

35 TBITS comp_arrayUC[] ; /* output array of compressed */ 

/* data */ 

{ 

COUNT i; /* loop index */ 
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COUNT 


num_ sign_ change ; 


/* 


counts number of sign 


*/ 








/* 


changes 


*/ 




PATTlTfTl 

COUNT 


value ; 


/* for general use temporary 


*/ 








/ * 


storage 


*/ 


5 


COUNT 


byte_part ; 


/* 


1 if upper 4 bits of byte, 


*/ 








/* 


and 2 if lower 4 bits of 


*/ 








/* 


byte 


*/ 




TBITS 


*comp_arrayUCP ; 


/* 


ptr to comp_arrayUC of sig. 


.*/ 








/* 


data 


*/ 


10 


TBITS 


byte_maskUC ; 


/* 


byte used to temporarily 


*/ 








/* 


hold compressed data 


*/ 




COUNT 


last_index; 


/* 


saves last index of direct.*/ 








/* 


change 


*/ 




COUNT 


current_signr 


/* 


used in sign change 


*/ 


15 






/* 


determination 


*/ 




COUNT 


last_sign; 


/* 


used in sign change 


*/ 






* 


/* 


determination 


*/ 




BOOL 


set_4_bits ( ) ; 


/* 


function to pack values 


*/ 








/* 


into the upper or lower 


*/ 


20 






/* 


4 bits of a byte 


*/ 




BOOL 


get_4_bits ( ) ; 


/* 


function to unpack values 


*/ 








/* 


from the upper or lower 


*/ 








/* 


4 bits of a byte 


*/ 




BOOL 


pack_halfbytes() ; 


/* 


implements main packing 


*/ 


25 






/* 


logic 


*/ 



/* 



* • 

* Initializations for the compression process. 

* 

30 */ 

*num_comp_bytesP = 0; 
*comp_arrayUCP = comp^arrayUC ; 

/* 

* 

35 * First, save the number of items in the array to be 

* compressed. This value is stored as two bytes. The first 

* byte is the upper 8 bits of the 16 bit COUNT value for 

* numin, and the second byte is the lower 8 bits. 
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* _ 

*/ 

*num_comp_bytesP += 1; 

*comp_arrayUCP++ = (TBITS) ((numin >> 8) & Oxf f ) ; 
5 *num_comp_bytesP += 1; 

*comp_arrayUCP++ = (TBITS) (numin & Oxf f ) ; 

/* 

* - 

* Find the number of sign changes (i.e., when successive 
10 * difference values have different signs) and save to 

* comp_arrayUC. Note that the maximum number of sign changes 

* allowed is 255. The function immediately exits and returns 

* an error flag if > 255 sign changes are detected. 

* _ 

15 */ 

num_sign_change = 0; 

last_sign = array [1] - array [0]; 

for (i = 2; i < numin; ++i) 
{ 

20 current_sign = array [i] - array [i - 1]; 

if ( (current_sign < 0 && last_sign >= 0) J| 
(current_sign >= 0 && last_sign < 0) ) 
{ 

last_sign = current_sign; 

2 5 ++num_s ign_change ; 

) 

} 

if (num_sign_change < 256) 
{ 

3 0 *num_comp_bytesP += l; 

*comp_arrayUCP++ = (TBITS) num_sign_change; 

} 

else 

< 

35 fprintf (stderr, "\n\nCMPSIG — num_sign_change > 

256\n\n") ; 
return (NO) ; 

) 
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/* 

*- 

* Initializations for compression 

* 

5 */ 

byte_part = 1; 
byte_maskUC = 0; 

/* 

* 

10 * Pack the sign of the first value of the input data (i.e., of 

* array [0]) 

* 

*/ 

if (arrayfO] >= 0) 
15 value =1; 

else 

value = 0 ; 

pack_halfbytes( value, &byte_part, &byte_maskUC, 
&comp_arrayUCP, num_comp_bytesP) ; 

20 /* 

* 

* Pack the sign of the first difference value (i.e., of 

* array [1] - array [0]) 

* — - 

25 */ 

last_sign = array [1] - array [ 0] ; 
if (last_sign < 0) 
value = 0; 

else 

3 0 value = 1; 

pack_half bytes (value, &byte_part, &byte_maskUC, 
&comp_arrayUCP, num_comp_bytesP) ; 

/* 

* 

35 * Pack the sign change positions (where successive difference 

* values have different sign) . 

★ — - 

*/ 
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last_index = 0; 
for (i = 2; i < numin; ++i) 
{ 

current_sign = array [i] - array [i - 1]; 
5 if ( (current_sign < 0 && last_sign >= 0) j ] 

(current_sign >= 0 && last_sign < 0) ) 

{ 

last_sign = current_sign; 
if ((i - last_index) > 255) 
10 { 

fprintf (stderr, "\n\nCMPSIG — sign index 

change > 255\n\n") ; 
return (NO) ; 

\ 

15 pack_half bytes (i - last_index, &byte_part, 

&byte_maskUC, &comp_arrayUCP, 
num_comp_bytesP) ; 
last_index = i; 

} 

20 ) /* end of loop to pack sign */ 

/* change position values */ 



* Pack differences values into bytes and save to comp_arrayUC 
25 * 

*/ 

for (i = 0; i < numin; ++i) 
{ 

if (i ==0) 
30 value = abs (array [ 0] ) ; 

else 

value = abs(array[i] - array[i - l]); 
pack_half bytes (value, &byte_part, &byte_maskUC, 
&comp_arrayUCP, num_comp_bytesP) ; 
35 if (*num_comp_bytesP >= maxout) 

{ 

fprintf (stderr, "\n\nCMPSIG — comp_arrayUC too 
large\n\n") ; 
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return (NO) ; 
) 

} /* end of loop to pack */ 

/* difference values */ 

5 /* 

* 

* Finally, if the data ends on an odd number of half bytes, 

* must save one final byte because the last signature data 

* value is in the first half byte of byte_maskUC and hasn't 
10 * yet been saved to comp_arrayUC. 

* — — 

*/ 

if (bytejpart ==2) 
{ 

15 set_4_bits( (UTINY) 15, 2, &byte_maskUC) ; 

*num_comp_bytesP += 1; 
if (*num_comp_bytesP > maxout) 
{ 

fprintf (stderr, "\n\nCMPSIG — comp_arrayUC too 
20 large\n\n") ; 

return (NO) ; 
) 

*comp_arrayUCP++ = byte_maskUC; 
) 

25 /* 

* . _ 

* If this point is reached the compression process has 

* proceeded normally and no errors have been detected, so 

* return with a "successful completion" status flag. 

30 * 

*/ 

return (YES) ; 

* 

*/ 

35 } /* end of function CMPSIG2 */ 
/* PACK_HALFBYTES . C 
* 

. * Function to pack a COUNT value into half bytes. 
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28 



pack_half bytes returns: 1 



PCT/US93/06883 



(i.e. YES or TRUE) for 
successful packing 

(i.e. NO or FALSE) if 
there is an error 



10 



15 



20 



25 



30 



35 



*/ 

BOOL pack_half bytes (value, byte_partP, byte_maskUCP, 
comp_arrayUCPP , num_comp_bytesP) 



/* 

Ac- 



count value; 
COUNT *byte_partP; 



TBITS *byte_maskUCP; 
TBITS **comp_arrayUCPP; 
COUNT *num_comp_bytesP; 
{ 



/* value to be compressed 
/* 1 if upper 4 bits of byte, 
/* and 2 if lower 4 bits of 
/* byte 

/* packed byte representing 
/* different values 
/* ptr to ptr to comp_arrayUC */ 
/* element */ 
/* actual #bytes used in */ 
/* comp_arrayUC */ 

/* temporary value */ 



*/ 
V 
*/ 
V 

*/ 
*/ 



UTINY tempUC; 

UTINY continuation_labelUC;/* used to save continuation*/ 



BOOL get_4_bits() 



BOOL set_4_bits() 



if (value < 14) 



{ 



/* label */ 

/* function to unpack values */ 

/* from the upper or lower 4 */ 

/* bits of a byte */ 

/* function to pack values */ 

/* into the upper or lower 4 */ 

/* bits of a byte */ 



* Pack 4 bit value into byte. When the absolute difference 

* value is < 14, then it can be packed into a half_byte (4 

* bits) . 

* 
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set_4_bits( (UTINY) value, *byte_partP, byte_maskUCP) ; 
if (*byte_partP == 2) 
{ 

*byte_partP = 1; 

5 *num_comp_bytesP += 1; 

**comp_arrayUCPP = *byte_maskUCP; 

++ ( *corap_arrayUCPP) ; 

} 

else 

10 *byte_partP = 2; 

) 

else 

{ 

/* 

15 * 

* Pack 4 bit continuation label into byte. When the absolute 

* difference value is <= 30 (and, of course, >= 14 since the 

* previous block above deals with values < 14) , then pack a 

* value 14 into the next half byte, and write the remainder 
20 * into a subsequent half byte* If the absolute difference 

* value is >= 30, pack 15 into the first half byte and pack 

* the remainder into another byte. If the absolute difference 

* value is > 14 + 255 = 269, then an error has occurred and 

* the function immediately exits and returns an error 
25 * condition. 

* 

*/ 

if (value > 269) 
{ 

30 fprintf (stderr, "\n\nCMPSIG - abs dif value out of 

") ; 

fprintf (stderr, "range (= %d)\n\n", value); 

return (NO); /* return on ERROR detection */ 

) 

35 else if (value < 30) 

continuation_labelUC = (UTINY) 14; 

else 

continuation_labelUC = (UTINY) 15; 



« 
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/* 

* — - 

* First, set half byte for continuation label. 

* 

5 */ 

set_4_bits (continuation_labelUC, *byte_partP, 

byte_maskUCP) ; 
if (*bytejpartP == 2) 

{ 

10 *byte_partP =1; 

*num_comp_bytesP += 1; 
**comp_arrayUCPP = *byte_maskUCP; 
++(*comp_arrayUCPP) ; 

} 

15 else 

*byte_partP = 2 ; 
if (value < 30) 

{ 

/* 

20 * 

* Pack a value , corresponding to (value - 14) into a second 

* half byte (i.e. f a value from 14 to 29 will be converted to 

* a value from 0 to 15 for packing) . 

* 

25 */ 

set_4_bits( (UTINY) (value - 14), *byte_partP, 

byte_maskUCP) ; 
if (*byte_partP == 2) 

{ 

30 *byte_partP = l; 

*num_comp_bytesP += l ; 
**comp_arrayUCPP = *byte_maskUCP; 
++(*comp_arrayUCPP) ; 
) 

35 else 

*byte_partP = 2 ; 

} 

else 
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/* 



* If the original absolute difference value is >= 30, then 
5 * must pack two more half bytes in additional to the 

* continuation half byte. 

* — 

*/ 

if (*byte_partP == 1) 
10 ( 

*byte_maskUCP = (TBITS) (value - 14); 

*num_comp_bytesP += 1 ; 
**comp_arrayUCPP = *byte_maskUCP; 
++(*comp_arrayUCPP) ; 
15 } 

else 
{ 

get_4_bits(&tempUC, 1, (TBITS) (value - 14)); 
set_4_bits (tempUC, 2 , byte_maskUCP) ; 
20 *num_comp_bytesP += l; 

**comp_arrayUCPP = *byte_maskUCP; 
++ (*comp_arrayUCPP) ; 

get_4_bits(&tempUC, 2, (TBITS) (value - 14)); 
set_4_bits((UTINY) tempUC, 1, byte_maskUCP) ; 
25 } 

) 

} 

return (YES); /* value unpacked successfully*/ 
/* v 

30 } /* end of pack_half bytes */ 

/* UNCMPSIG2 . C 

* 

* Function to uncompress an array of data compressed by 

* function CMPSIG2. NOTE: compression/uncompression is exact 
3 5 * (not lossy) and restores the original data exactly. 

* 

* uncmpsig2 returns: 1 (i.e. YES or TRUE) for 

* successful uncompress 
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r * 

* 0 (i.e. NO or FALSE) if 

* there is an error 
* 

* input arguments: 

5 * comp_arrayUC - unsigned byte array containing the 

* compressed representation of the data 

* (as created by CMPSIG2 . C) 
* 

* output arguments: 

10 * numoutP - ptr to number of elements (length) of the 

* uncompressed data 

* array - data array created by unpacking and 

* reconstructing values taken from the 

* compressed array comp_arrayUC[ ] . 

15 * 

*/ 

# include <stdio.h> 
#include <cic.h> 



20 BOOL uncmpsig2 (numoutP, ar 
COUNT * numoutP; 

COUNT array [ ] ; 

25 

TBITS comp_arrayUC [ ] ; 
{ 

30 COUNT i; 

COUNT kount_bytes; 

TBITS/ *comp_arrayUCP; 

35 COUNT num_sign_change; 

COUNT value; 



, comp_arrayUC) 



/* 


number of points in output 


*/ 


/* 


array 


*/ 


/* 


array for output signat 


*/ 


/* 


data that has reconst from 


*/ 


/* 


comp_arrayUC by reversing 


*/ 


/* 


the compression proc. 


*/ 


/* 


compressed input array of 


*/ 


/* 


sig data 


*/ 


/* 


loop index 


*/ 


/* 


kounts bytes used in 


V 


/* 


comp_array 


*/ 


/* 


ptr to comp_arrayUC of sig 


*/ 


/* 


data 


*/ 


/* 


counts number of sign 


*/ 


/* 


changes 


V 


/* 


for general use temporary 


*/ 


/* 


storage 


*/ 
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15 /i 



COUNT 


byte jpart ; 


/* 


1 if upper 4 bits of bvtp 


/ 






/* 


and 0 \ "F 1 OT»T<st- A Kif e /-\ -C 
4V -* -Li- lUWcl *± D1LS OI 


*/ 






/* 


byte 


/ 


TBITS 


byte_maskUC ; 


/* 




V 






/* 


different val npc 


V 


COUNT 


sign change [2 56]; 


/* 


a -*-^»»y ot sign cnange 


*/ 








ind i r*^c; 


*/ 


COUNT 


* s ign_changeP ; 


/* 


pointer to sian chancrp 


* / 






/* 


index array 


*/ 


COUNT 


sign_f irst_dif ; 


/* 


saves sign of first 


*/ 






/* 


difference 


*/ 


COUNT 


current_sign ; 


/* 


current value of sign 


*/ 


BOOL 


unpackjialf bytes ( ) 


? /* 


implements main unpacking 


*/ 






/* 


logic 


*/ 



* Initializations for compressed array 



* — 

*/ 



kount_bytes 
comp_arrayUCP 



= 0; 

= comp_arrayuC ; 



/* 

*- 



* First, get the number of items in the array that was 

* compressed. This value is stored as two bytes. The first 

* byte is the upper 8 bits of the 16 bit COUNT value for 

* numout, and the second byte is the lower 8 bits. 
* 

*/ 



35 



*numoutP 
*numoutP 
*numoutP 
* numout P 
kount_bytes += 2; 
/* 



= 0; 

= *comp_arrayUCP++ ; 
<<= 8; 

= *numoutP | (*comp_arrayUCP++) ; 



* Get the number of sign changes 
* 
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num_sign_change = (COUNT) (*comp_arrayUCP++) ; 
++kount_bytes ; 

/* 

5 * 

* Check for error condition. If the value read in for the 

* number of sign changes is >= 256 , then an error has 

* occurred. 

* 

if (num_sign_change < 0 j j num_sign_change >= 256) 
{ 

fprintf (stderr, "\n\nUNCMPSIG — number of sign 

changes out ") ; 
fprintf (stderr, "of range (= %d) is >= 256\n\n", 

num_sign_change) ; 
return (NO); /* return on ERROR detection */ 

} 

/* 

* . _ 

* Initializations for unpacking. 

* : 

*/ 

byte_part = 1; 
25 /* 

★ — 

* Unpack the sign of the first value of the original, 

* uncompressed data. 

★ 

30 */ 

unpack_hal f by tes ( & value , &byte_part , &byte_maskUC , 

&comp_arrayUCP f &kount_bytes) ; 
if (value ~ 0) 

current_sign = -l; 
35 else if (value == 1) 

current_sign 1; 

else 

{ 



10 



15 
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/* 

* — 



* Error condition since value != 0 (for minus sign) and also 

* != 1 (for plus sign) . 



*/ 








f print f ( stderr , 


"\n\nUNCMPSIG — initial sign value ") ; 




f print f ( stderr , 


"is inconsistent (= %d)\n\n", value); 




return (NO) ; 


/* return on ERROR detection */ 


10 


) 




/* 







* Unpack the sign of the first difference value of the 

* original, uncompressed data. 

15 * 



*/ 

unpack_ha If bytes ( &value , &byte_part , &byte_maskUC , 
& comp_a r r ay U CP , 
&kount_bytes) ; 
20 if (value == 0) 

sign_f irst_dif = -1; 
else if (value == l) 

sign_f irst_dif = l; 

else 

25 { 
/* 

* 



* Error condition since value != 0 (for minus sign) and also 

* != 1 (for plus sign). 



*/ 








fprintf (stderr, 


"\n\nUNCMPSIG — initial sign value ") ; 




fprintf (stderr, 


"is inconsistent (= %d)\n\n", value); 




return (NO) ; 


/* return on ERROR detection */ 


35 


) 




/* 







* Unpack sign change positions (where successive difference 
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'* * values have different sign) . 
★ — — — _ 

*/ 

for (i = 0; i < num_sign_change; ++i) 
5 { 

unpack_half bytes (fcvalue, &byte_part, &bytejnaskUC, 
&comp_arrayUCP, &kount_bytes) ; 

if (i « 0) 

sign_change[0] = value; 
10 else v 

sign_change[i] = sign_change[i - 1] + value; 
} /* end of loop to unpack sign */ 

/*change position values */ 



/* 

15 *- 



20 



* Unpack differences from bytes and read from disk. 
* 

*/ 

sign_changeP = sign_change;/* pointer to sign change */ 

/* index array */ 
for (i = 0; i < *numoutP; ++i) 

{ 

unpackjial f by tes ( Svalue , &byte_part , &byte_maskUC , 
&comp_arrayUCP, &kount_bytes) ; 
25 if (i > 1 && num_sign_change > 0 && i == *sign_changeP) 

{ 

current_sign = -current_sign; 
— num_s ign_change ; 
++s ign_changeP ; 
30 ) 

if (i « 0) 

array [i] = current_sign * value? 
else if (i == 1) 

{ 

35 sign_changeP = sign_change; 

array [i] = array [i - 1] + sign_f irst_dif * value; 

current_sign = sign_f irst_dif ; 

) 



WO 94/03853 



PCT/US93/06883 



37 

else 

array [i] = array [i - 1] + current_sign * value; 
} /* end of loop to unpack and */ 

/* reconstruct data */ 

5 /* 

*- _ _ _ 

* Check for inconsistency in specified size (in bytes) of the 

* input compressed array (comp_array) versus the actual number 

* of bytes used in the uncompress process to create the output 
10 * uncompressed array, 

* 

if (kount_bytes != num_comp_bytes) 

{ 

fprintf (stderr, M \n\nUNCMPSIG kount_bytes (= %d) 
15 i= », kount_bytes) ; 

fprintf (stderr, M num_comp_bytes (= %d)\n\n", 
num_comp_bytes) ; 

return (NO) ; 
} 

20 */ 
/* 

★ _ 

* If this point is reached the uncompression process has 

* proceeded normally and no errors have been detected, so 

25 * return with a " successful completion" status flag. 

* _ _ 

*/ 

return (YES) ; 

/* 

30 ) /* end of function UNCMPSIG2 */ 
/* UNPACK_HALFBYTES . C 
*- — _ 

* Function to unpack and reconstruct a COUNT value from half 

* bytes. 
35 * 

* pack_half bytes returns: 1 (i.e. YES or TRUE) for 

* successful unpacking 

* 0 (i.e. NO or FALSE) if 
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there is an error 



*/ 

BOOL unpack_half bytes (valueP, byte_partP, byte_maskUCP, 
comp_arrayUCPP , kount_by tesP ) 



COUNT *valueP; 

COUNT *byte_partP; 

TBITS *byte_maskUCP; 
TBITS **comp_arrayUCPP ; 
COUNT *kount_bytesP; 

{ 

UTINY tempUC; 
UTINY valueUC; 
BOOL get_4_bits ( ) ; 

BOOL set_4_bits ( ) ; 



/* contains uncompressed */ 
/* value to be returned to */ 
/* main program */ 
/* 1 if upper 4 bits of byte,*/ 
/* and 2 if lower 4 bits of */ 
/* byte */ 
/* packed byte representing */ 
/* different values */ 
/* ptr to comp_arrayUC of sig.*/ 
/* data 

/* kounts bytes used in 
/ * comp_array 



/* temporary value 

/* temporary value 

/* function to unpack values 

/* from the upper or lower 4 

/* bits of a byte 

/* function to pack values 

/* into the upper or lower 4 

/* bits of a byte 



*/ 
*/ 
*/ 

*/ 
V 
*/ 
*/ 

V 
V 
*/ 



* Unpack 4 bits. When the value is < 14 , it corresponds to a 
30 * legitimate difference value, and hence the next array value 

* can be calculated by adding the difference (with the correct 

* sign) to the preceding array value. If the value == 14, the 

* absolute difference value is >= 14 and <= 29, hence another 

* half byte must be read in to compose the absolute 

35 * difference. If the value == 15, the absolute difference 

* value is >= 30 and < 269, hence another full byte (i.e., 2 

* half bytes) must be read in to compose the absolute 

* difference. 
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* 

if (*byte_partP == 1) 

{ 

5 ++(*kount_bytesP) ; 

*byte_inaskUCP = ** compear ray UCPP; 
++ ( * compear rayUCPP) ; 

} 

get_4_bits ( SvalueUC , *byte_partP , *byte_maskUCP) ; 
10 *byte_partP = 3 - *byte_partP ; 

if (valueUC < 14) 

*valueP = (COUNT) valueUC; 
else if (valueUC == 14) 
{ 

15 if (*byte_partP == 1) 

{ 

*byte_maskUCP = **comp_arrayUCPP; 
++(* compear ray UCPP) ; 
++(*kount_bytesP) ; 
20 ) 

get_4_bits(&valueUC, *byte_partP, *byte_maskUCP) ; 
*valueP = (COUNT) (valueUC + 14); 
*byte_partP = 3 - *byte_partP; 

} 

25 else if (valueUC == 15) 

{ 

if (*byte_partP ==1) 
{ 

*byte_maskUCP = **comp_arrayUCPP; 
3 0 ++ ( *comp_arrayUCPP) ; 

++ ( *kount_bytesP) ; 
} 

get_4_bits(&t6inpUC, *byte_partP, *byte_maskUCP) ; 
35 set_4_bits(tempUC, 1, &valueUC) ; 

*byte_partP = 3 - *bytejpartP; 
if (*byte_partP == i) 

{ 
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*byte_jmaskUCP = **comp_arrayUCPP; 
++(*coirip_arrayUCPP) ; 
++(*kount_bytesP) ; 
) 

get_4_bits(&tempUC, *byte_partP, *byte_maskUCP) ; 
set_4_bits ( tempUC , 2 , SvalueUC) ; 
*valueP = (COUNT) (valueUC + 14); 
*byte_partP = 3 - *byte_partP; 
} 

{ 

*byte_maskUCP = **comp_arrayUCPP; 
++(*comp_arrayUCPP) ; ++ (*kount_bytesP) ; 



15 return (YES) ; 
/* 



V 



} 



/* value unpacked successfully 



V 



/* end of function UNPACK_HALFBYTES */ 



20 /* GET_4_BITS.C 




25 



30 



35 



* Function that returns the UTINY value corresponding to 

* either the upper 4 bits in the input byte, or the lower 4 

* bits. Of course this will be a value in the range 0 to 15. 

* The input arguement "position" determines whether the upper 

* or lower 4 bits is taken: 



* 
* 
* 
* 

*- 



position = 1 
position = 2 



upper four bits (or upper 
"nybble") 

lower four bits (or upper 
"nybble") 



*/ 

# include <stdio.h> 
#include <cic.h> 

BOOL get_4_bits(valueUCP, position, byte_maskUC) 

UTINY *valueUCP; /* set the four bits to this */ 

/* value */ 
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COUNT position; /* l specifies the upper 4 */ 

/* bits, and */ 

TBITS byte_maskUC; /* feature mask for template */ 

/* selection 2 specifies the */ 

5 /* lower 4 bits */ 

{ 

/* 

* — _ 

* Return original mask if error condition found 

10 * 

*/ 

if (position != 1 && position != 2) 
return (NO) ; 

/* 

15 * 

* Get 4 bits (or nybble) specified by the input argument 

* "position" 

*- 

*/ 

20 if (position == 1) 

byte_maskUC »= 4 ; 
*valueUCP = (UTINY) (bytejmaskUC & Oxf ) ; 

/* zero the upper four bits */ 

return (YES) ; 

25 /* 

V 

) /* end of function GET_4_BITS.C*/ 

*/ 

30 * SET_4_BITS.C 

* , 

* Function that sets either the upper 4 bits in the input 

* byte, or the lower 4 bits to a value in the range 0 to 15 

* specified by the input argument "value" . 
35 * 

* The input arguement "position" determines whether the upper 

* or lower 4 bits is set: 
* 
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* * "position = 1 — upper four bits (or upper 

* "nybble") 

* position =2 — lower four bits (or upper 

* "nybble") 

5 * . 

* NOTE: function returns a boolean YES for normal operation 

* and NO if an error is detected. 

* 

*/ 

10 #include <stdio.h> 
#include <cic.h> 

BOOL set_4_bits (valueUC, position, byte_maskUCP) 

UTINY valueUC; /* set the four bits to this */ 

/* value */ 
15 COUNT position; /* specifies the position of */ 

/* the two bits */ 
TBITS *byte_maskUCP; /* feature mask for template */ 

/* selection */ 

{ 

20 /* 

* 

* Return original mask if error condition found 

* 

*/ 

25 if (valueUC > (UTINY) 15) 

return (NO) ; 
if (position != 1 && position 1= 2) 
return (NO) ; 

/* 

30 * 

* Set bits as specified by the input arguments 

* 

*/ 

if (position == 1) 
35 { 

*byte_maskUCP &= * (OxfO) ; 

/* zero upper 4 bits inbytejnaskUCP*/ 
*byte_maskUCP j= (valueUC « 4); 
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/* set upper 4 bits to new value */ 



) 



else 



{ 

*byte_maskUCP &= "(Oxf); 

/* zero lower 4 bits in byte_maskUCP*/ 
*byte_maskUCP |= valueUC; 



10 



/* 
/* 



15 



* 

*/ 



) 

return (YES) ; 


/* set 


upper 4 bits to 


new value 


*/ 










} 


/* end 


of function SET_ 


,4_BITS 


V 


This file contains 
constants , macros , 


standard 
and data 


type definitions 
structures . 


, defined 





25 



#def ine 


FAST 


register 


#def ine 


GLOBAL 




#def ine 


IMPORT 


extern 


#def ine 


THISFL 


extern 


#def ine 


INTERN 


static 


#def ine 


LOCAL 


static 


typedef 


void 


VOID ; 


typedef 


char 


TBOOL, TINY, TEXT; 


typedef 


short 


COUNT , SHORT; 


typedef 


int 


INT, ARGINT, METACH ; 


typedef 


long 


LONG; 



typedef unsigned char UTINY, TBITS, UTEXT; 

typedef unsigned short BITS, UCOUNT, USHORT, SIZETYPE; 

typedef unsigned int BYTES, UINT; 

35 typedef unsigned long ULONG , LBITS; 



typedef float FLOAT; 
typedef double DOUBLE; 
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* typedef USHORT ERCTYPE; 

typedef enum 
{ 

5 NO = 0, 

YES = 1 
} BOOL; 

typedef enum 
10 { 

SUCCEED = 0, 

FAIL =1 
) SUCCESS; 

15 #ifndef abs 

#define abs(x) ( ( (x) < 0) ? -(x) : (x)) 

#endif 

#ifndef max 

define max(x,y) (((x) < (y) ) ? (y) : ( X ) ) 
20 #endif 

#ifndef min 

#define min(x,y) (((x) < (y) ) ? (x) : (y)) 
#endif 

25 #define FOREVER while (YES) 



#define EOS ( '\0' ) 
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* WE CLAIM: 

1. A method for electronically compressing 
handwritten data expressible as a sequence of data points, the 
5 method comprising the steps of: 

capturing a parametric representation of data 
handwritten on a digitizer tablet; 

determining local extrema and saving representations 
of said local extrema as first data points; 
10 determining stroke endpoints and saving 4 

representations of said stroke endpoints as second data points; 
and 

calculating curvature locally about selected data 
points between said second data points and saving only selected 
15 representations of said selected data points based on 
curvature . 



20 



30 



35 



2. The method according to claim 1 wherein said 
parametric representation of the digitizer data is a sequence 
of x and y coordinate values in a Cartesian coordinate system 
as a function of time. 



3. The method according to claim 1 wherein said 
local extrema determining step comprises selecting minimum x^ 
25 and y.^ and maximum x^ and y.^ according to the following 
relationships: 

minimum = < x i+1 and x^ < x.^, 
maximum = x^_ > x i+1 and x^ > x i _ 1 , 
minimum = y i < y i+1 and y ± < y i-lf 
maximum = y L > y i+1 and y i > y^, 

where 

X£ and y A over the range i = 0 to i = nsamps-2 is the 
discrete representation of a stroke, and 
nsamps is the number of points. 



4 . The method according to claim 1 wherein said 
stroke endpoint determining step comprises selecting a first 
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endpoint of a stroke, x Q and y 0 , and selecting a last endpoint 

of a stroke, x nsa:nps . 1 and Y nsamps , lf 

where 

X£ and over the range i = 0 to i = nsamps-1 is the 
5 discrete representation of a stroke, and 
nsamps is the number of points. 

5. The method according to claim 1 wherein said 
curvature determining step comprises calculating an angle 

10 between adjacent data points according to the relationfehip: 
c i = a i+d ~ a i-d' 

where 

C± = C± - 3 60 degrees if C i > 180 degrees, 
c i = c i + 360 Agrees if < -180 degrees, 
15 a i+d = a r<=tan (^ i+d - x ± , y i+d - y L ) , 

a i-d = arctan (*i - *i- d , Y± - Yi- d ) , 
d is an integer no larger than 4, 

x± and y i over the range i = d to i = nsamps-l-d is 
the discrete representation of a stroke, and 
20 nsamps is the number of points. 

6. The method according to claim 5 wherein said 
curvature angle calculating step further comprises discarding 
data points with a curvature angle less than 5 degrees or 

25 greater than 25 degrees. 

7. The method according to claim 5 wherein said 
curvature angle calculating step further comprises discarding 
data points with a curvature angle less than 25 degrees or 

30 greater than 50 degrees. 

8. The method according to claim 1 further 
including a preprocessing step of discarding duplicate data 
points according to the following relationships: 

35 if Xi = x i _ 1 and Yi = y^, 

then discard x^ and y i# * 
if x L * x i . 1 and y ± # y^, 
then save x^ and Yi ; 
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'where 

and yj_ over the range i = 1 to i = nsamps-1 is the 
discrete representation of a stroke, and 
nsainps is the number of points. 

5 

9. The method according to claim 1 further 
including a preprocessing step of discarding data points 
according to the following procedure: 

set j = 1, 

10 calculate d = { (x Q - Xj) 2 + (y Q - yj) 2 }^ 2 , 

if d < 5, then discard point corresponding to index 
j, set j = j + l, and recalculate d; 

if d > 5, then shift remaining points down so that 
the point (Xj, y j ) corresponds to (x x , y ± ) , and quit the 
15 calculation, 
where 

x i and y± over the range i = 0 to i = nsamps-1 is the 
discrete representation of a stroke, and 
nsamps is the number of points. 

20 

10. The method according to claim 1 further 
including a preprocessing step of discarding data points 
according to the following procedure: 

set j = 2, 

25 calculate d = { (x n8anp8 . 1 -x nsamps . j ) 2 + 

( y nsamps-l~ynsamps-j ) * 2 ' 
if d < 5, then discard point corresponding to index 

j, set j = j + 1, and recalculate d; 

if d > 5, then shift remaining points down so that 

30 the point (x nsanps . j; Ynsamps-j) corresponds to (x nsamps _ 2 , 

y nsamps - 2 ) and quit the calculation. 

11. The method according to claim 1 further 
including a preprocessing step of dividing data points between 

35 said first data points into groups of three adjacent points and 
calculating the median value of said group according to the 
following relationships: 

Xj = median value of {Xj_ lf x j , Xj +1 ), and 



# 
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Yj = median value of (Yj^, y j , Yj +1 >. 

where 

x ± and y i over the range i = 1 to i = nsamps-2 is the 
discrete representation of a stroke, and 
5 nsamps is the number of points. 



12. The method according to claim 1 further 
including a preprocessing step of dividing data points between 
said first data points into groups of three adjacent data 
10 points and calculating a representative value for said group 
according to the following relationships: 

Xj = 0. 25x^.3^ + 0.5Xj + 0.25Xj +1 , and 

Yj = O^Sy-j^ + O.Syj + 0.25yj + 1 . 

where 

15 x i and over the range i = 1 to i = nsamps-2 is the 

discrete representation of a stroke, and 
nsamps is the number of points. 



20 



13. An apparatus for electronically compressing 
handwritten data expressible as a sequence of data points, the 
apparatus comprising: 

means for capturing a parametric representation of 
data handwritten on a digitizer tablet; 

means for determining local extrema and saving 
25 representations of said local extrema as first data points; 

means for determining stroke endpoints and saving 
representations of said stroke endpoints as second data points; 
and 

means for calculating curvature locally about 
selected data points between said second data points and saving 
only selected representations of said selected data points 
based on curvature. 



30 



14. The apparatus according to claim 13 wherein said 
35 parametric representation of the digitizer data is a sequence 
of x and y coordinate values in a Cartesian coordinate system 
as a function of time. 
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c * " 15. The apparatus according to claim 13 wherein said 

local extrema determining means comprises means for selecting 
minimum x^ and and means for selecting maximum x^ and 
according to the following relationships: 
5 minimum = < Xj_ +1 and x^ < x^-i* 

maximum = > x i+1 and x i > x i . 1/ 
minimum = y L < y i+1 and y ± < y^ ± , 
maximum = y ± > y i+1 and y± > y^, 

where 

10 and y^ over the range i = 0 to i = nsamps-2 is the 

discrete representation of a stroke, and 
nsamps is the number of points. 

16. The apparatus according to claim 13. wherein said 
15 stroke endpoint determining means comprises means for selecting 

a first endpoint of a stroke, x Q and y 0 , and means for 
selecting a last endpoint of a stroke, x nsainps _ 1 and y^.^ 
where 

x i and y^ over the range i = 0 to i = nsamps-1 is the 
20 discrete representation of a stroke, and 
nsamps is the number of points. 

17. The apparatus according to claim 13 wherein said 
curvature determining means comprises means for calculating an 

25 angle between adjacent data points according to the 
relationship: 

C i = a i+d " a i-d' 

where 

Cj^ = C i - 3 60 degrees if > 180 degrees, 
30 c i = c i + 360 degrees if C i < -180 degrees, 

a i+d = a rctan (x i+d - x if y i+d - y ± ) , 
a i-d = arctan (x A - x i-d , yi - y i-d ) , 
d is an integer no larger than 4, 

x^ and y^ over the range i = d to i = nsamps-l-d is 
3 5 the discrete representation of a stroke, and 
nsamps is the number of points. 
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18. The apparatus according to claim 17 wherein said 
curvature angle calculating means further comprises means for 
discarding data points with a curvature angle less than 5 
degrees or greater than 25 degrees. 

19. The apparatus according to claim 17 wherein said 
curvature angle calculating means further comprises means for 
discarding data points with a curvature angle less than 25 
degrees or greater than 50 degrees. 



10 



15 



20 



20. The apparatus according to claim 13 further 
including a preprocessing means for discarding duplicate data 
points according to the following relationships: 

if x ± = x^-l and yi = y^, 

then discard and y^; 

if x ± # x ±mm± and y A + Yi^, 

then save and y^; 

where 

x ± and y A over the range i = 1 to i = nsamps-1 is the 
discrete representation of a stroke, and 
nsamps is the number of points. 



21. The apparatus according to claim 13 further 
25 including a preprocessing means for discarding data points 
according to the following procedure: 
set j = l, 

calculate d - { (x Q - Xj ) 2 + (y Q - yj ) 2 , h f 
if d < 5, then discard point corresponding to index 
30 j, set j = j + 1, and recalculate d; 

if d > 5, then shift remaining points down so that 

the point (Xj, y - ) corresponds to (x 1; y^ , and quit the 

calculation, 

where 

35 * x.^ and y^^ over the range i = 0 to i = nsamps-l is the 

discrete representation of a stroke, and 
nsamps is the number of points. 
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** ** 22. The apparatus according to claim 13 further 

including a preprocessing means for discarding data points 
according to the following procedure: 
set j = 2, 

5 calculate d = ((Xngaaps.^^) 2 ^ 

(^nsamps-l^nsamps- j ) ^ ' 
if d < 5, then discard point corresponding to index 
j, set j = j + l, and recalculate d; 

if d > 5, then shift remaining points down so that 
10 the point (x nsamps _j, y nsam p S .j) corresponds to (x nsamps _ 2 , 
y nsam p S _2) and quit the calculation. 

23. The apparatus according to claim 13 further 
including a preprocessing means for dividing data points 
15 between said first data points into groups of three adjacent 

points and means for calculating the median value of said group 
according to the following relationships: 

Xj = median value of {Xj_ X/ x j , Xj +1 ), and 
yj = median value of {y^, y j , Yj + 1 >. 

20 where 

x^ and over the range i = 1 to i = nsamps-2 is the 
discrete representation of a stroke, and 
nsamps is the number of points. 

25 24. The apparatus according to claim 13 further 

including a preprocessing means for dividing data points 
between said first data points into groups of three adjacent 
data points and means for calculating a representative value 
for said group according to the following relationships: 

30 Xj = CaSx-j^ + 0.5Xj + 0.25Xj +1 , and 

yj = O^Sy-j^ + 0.5yj + 0.25y j+1 . 

where 

x^ and y± over the range i = 1 to i = nsamps-2 is the 
discrete representation of a stroke, and 
35 nsamps is the number of points. 
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